Microarray Leukemia Gene Data Clustering by Means of Generalized Self-organizing Neural Networks with Evolving Tree-Like Structures
نویسندگان
چکیده
The paper presents the application of our clustering technique based on generalized self-organizing neural networks with evolving treelike structures to complex cluster-analysis problems including, in particular, the sample-based and gene-based clusterings of microarray Leukemia gene data set. Our approach works in a fully unsupervised way, i.e., without the necessity to predefine the number of clusters and using unlabelled data. It is particularly important in the gene-based clustering of microarray data for which the number of gene clusters is unknown in advance. In the sample-based clustering of the Leukemia data set, our approach gives better results than those reported in the literature and obtained using a method that requires the cluster number to be defined in advance. In the gene-based clustering of the considered data, our approach generates clusters that are easily divisible into subclusters related to particular sample classes. It corresponds, in a way, to subspace clustering that is highly desirable in microarray data analysis.
منابع مشابه
Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملDna Microarray Data Clustering Using Growing Self Organizing Networks
Recent advances in DNA microarray technology have allowed biologists to simultaneously monitor the activities of thousands of genes. To obtain meaning from these large amounts of complex data, data mining techniques such as clustering are being applied. This study investigates the application of some recently developed incremental, competitive and self-organizing neural networks (Growing Cell S...
متن کاملA dynamically growing self-organizing tree (DGSOT) for hierarchical clustering gene expression profiles
MOTIVATION The increasing use of microarray technologies is generating large amounts of data that must be processed in order to extract useful and rational fundamental patterns of gene expression. Hierarchical clustering technology is one method used to analyze gene expression data, but traditional hierarchical clustering algorithms suffer from several drawbacks (e.g. fixed topology structure; ...
متن کاملIdentification and Evaluation of Functional Modules in Gene Co-expression Networks
Identifying gene functional modules is an important step towards elucidating gene functions at a global scale. In this paper, we introduce a simple method to construct gene co-expression networks from microarray data, and then propose an efficient spectral clustering algorithm to identify natural communities, which are relatively densely connected sub-graphs, in the network. To assess the effec...
متن کاملData Complexity in Clustering Analysis of Gene Microarray Expression Profiles
The increasing application of microarray technology is generating large amounts of high dimensional gene expression data. Genes participating in the same biological process tend to have similar expression patterns, and clustering is one of the most useful and efficient methods for identifying these patterns. Due to the complexity of microarray profiles, there are some limitations in directly ap...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015